Marking essays on screen: An investigation into the reliability of marking extended subjective texts

نویسندگان

  • Martin Johnson
  • Rita Nádas
  • John F. Bell
چکیده

There is a growing body of research literature that considers how the mode of assessment, either computeror paper-based, might affect candidates’ performances (Paek, 2005). Despite this, there is a fairly narrow literature that shifts the focus of attention to those making assessment judgments and which considers issues of assessor consistency when dealing with extended textual answers in different modes. This research project explored whether the mode in which a set of extended essay texts were accessed and read systematically influenced the assessment judgments made about them. During the project twelve experienced English Literature assessors marked two matched samples of ninety essay exam scripts on screen and on paper. A variety of statistical methods were used to compare the reliability of the essay marks given by the assessors across modes. It was found that mode did not present a systematic influence on marking reliability. The analyses also compared examiners’ marks with a gold standard mark for each essay and found no shifts in the location of the standard of recognised attainment across modes. Introduction Literature suggests that the feasibility, validity and reliability of working on-screen have long been the focus of audiences from a wide range of backgrounds, ranging from studies in the contexts of education, to those in occupational and cognitive psychology, among others. This particular study sought to investigate whether the mode of marking (on-screen or paper) had any influence on essay marking reliability and markers’ leniency/rigour. Literature Review Bennett (2002) describes the rapid growth of computer technology use in workplaces and education as inexorable. Although technology offers the potential to broaden educational assessment beyond what traditional methods allow, there are inevitable concerns during a transition phase (where assessments exist in both paperand computer-based modes) that their outcomes are not comparable. In her review of comparability studies since 1993 Paek (2005) notes that the transition from paperto computer-based testing cannot be taken for granted and that comparability between the two testing modes needs to be established through carefully designed empirical work. She goes on to suggest that: Comparability studies explore the possibility of differential effects due to the use of computer-based tests instead of paper-and-pencils tests. These studies help ensure that test score interpretations remain valid and that students are not disadvantaged in any way by taking a computerized test instead of the typical paper test. (p.1) Gathering reliability measures is one significant practical step towards demonstrating the validity of computer-based testing during the transitional phase.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evidence Marking in Research Articles: An Investigation of its Sources and Relative Reliability through Quality Markers

Evidence occupies a paramount position in any logical endeavor and research article is consensually considered a predominant site of such an endeavor. One interesting area of rhetoric which addresses the source and reliability of evidence is quality metadiscourse. In this qualitative study, quality metadiscourse strategies (i.e., evidentials, hedges, boosters and disclaimers) are examined to in...

متن کامل

Markers’ criteria in assessing English essays: an exploratory study of the higher secondary school certificate (HSCC) in the Punjab province of Pakistan

Background: Marking of essays is mainly carried out by human raters who bring in their own subjective and idiosyncratic evaluation criteria, which sometimes lead to discrepancy. This discrepancy may in turn raise issues like reliability and fairness. The current research attempts to explore the evaluation criteria of markers on a national level high stakes examination conducted at 12th grade by...

متن کامل

Examining Negative Attitudes Toward Onscreen Marking in Hong Kong

This article details an investigation into onscreen marking (OSM) in Hong Kong — where paper-based marking (PBM) is being phased out, to be completely superseded by OSM. It is a specific follow-up to a larger study (Coniam, 2009a) involving 30 raters who had previously rated English language essay scripts on screen in the 2007 Hong Kong Certificate of Education Examination (HKCEE). In that stud...

متن کامل

The need for skin pen marking for sentinel lymph node biopsy: A comparative study

  Introduction: There is a consensus in the literature that sentinel lymph node biopsy is the standard procedure for axillary staging in early stage (I and II) breast cancer patients. Usually during lymphoscintigraphy, the location of the sentinel lymph node is marked on the skin by an indelible ink. In this study we evaluated this issue in our patients. Methods: 40 ...

متن کامل

Enhancing Skid Resistance of Two-Component Road Marking Paint using Mineral and Recycled Materials

Low skid resistance of road marking paint is one of the major issues in the safety of vehicle drivers, cyclists, and pedestrians when traveling on the city streets. Among the variety of marking paint, two-component paint is widely used at intersections and roundabouts. Therefore, the paint used should have adequate skid resistance. The object of this study was to evaluat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • BJET

دوره 41  شماره 

صفحات  -

تاریخ انتشار 2010